Universality of REM-like aging in mean field spin glasses 
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Aging has become the paradigm to describe dynamical behavior of glassy systems, and in par- 
ticular spin glasses. Trap models have been introduced as simple caricatures of effective dynamics 
of such systems. In this Letter we show that in a wide class of mean field models and on a wide 
range of time scales, aging occurs precisely as predicted by the REM-like trap model of Bouchaud 
and Dean. This is the first rigorous result about aging in mean field models except for the REM 
and the spherical model. 
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A key concept that has emerged over the last years in 
the study of dynamical properties of complex systems, is 
that of "aging" . It is applied to systems whose dynam- 
ics are dominated by slow transients towards equilibrium 
(see e.g. [H, 0, S0| for excellent reviews). This phenom- 
ena occurs in a huge variety of systems, such as glasses, 
spin-glasses, bio-molecules, polymers, plastics, and has 
obvious practical implications in real-world applications. 

When discussing aging dynamics, it is all important 
to specify the precise time-scales considered in relation 
to the volume. On the one hand, one may study the 
non-activated regime, where the infinite volume limit at 
fixed time t is taken first, and then one analyzes the en- 
suing dynamics as t tends to infinity. This non- activated 
regime has been studied intensively for Langevin dynam- 
ics of various soft-spin versions of mean field spin glasses 

For longer time scales, that is times diverging with 
the volume of the system, a full picture is largely miss- 
ing. The slow dynamics of complex systems in such time 
scales is often attributed to the presence of "thermally 
activated" barrier crossings in the configuration space 
[j| . For instance, the standard picture of the spin glass 
phase typically involves a highly complex landscape of 
the free energy function exhibiting many nested valleys 
organized according to some hierarchical tree structure 
(see e.g. [Io|, El)- To such a picture corresponds the 
heuristic image of a stochastic dynamics that, on time- 
scales that diverge with the size of the system, can be 
described as performing a sequence of "jumps" between 
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different valleys at random times those rates are gov- 
erned by the depths of the valleys and the heights of con- 
necting passes or saddle points. The extreme situation 
here corresponds to considering time scales just before 
the equilibration time. While at these scales the relation 
to the equilibrium Gibbs distribution is most immediate, 
in many glassy systems these time scales appear to be 
beyond experimental or numerical reach. 

In this letter we show that the mechanism for aging 
is universal for a class of Glauber dynamics of p-spin 
Spin Glasses (with p > 3), out of equilibrium, in a wide 
range of time scales. These time scales are exponentially 
long but still much shorter than the time scales needed 
to reach equilibrium. Thus this mechanism is essentially 
a transient one, linked to the exploration of the energy 
landscape long before the dynamics can feel the ground 
state. This mechanism has been first established for the 
simple case of the REM, hence our title. 

To capture the features of activated dynamics, 
Bouchaud and others [l], HI, 0, 0, HH introduced an in- 
teresting ansatz, that is a mapping of the dynamics onto 
"trap models" . These trap models are Markov jump pro- 
cesses on a state space that simply enumerates the valleys 
of the free energy landscape. While this picture is intu- 
itively appealing, its derivation is based on knowledge 
obtained in much simpler contexts, such as diffusions in 
finite dimensional potential landscapes. Mathematically, 
trap models are continuous time Markov chains whose 
state space is a (infinite or growing with some parameter) 
graph (e.g. Z d ). To the vertices (= traps) of this graph 
one associates independent random variables whose com- 
mon distribution is assumed to be heavy tailed, that is 
their mean is infinite. These variables represent the mean 
waiting times of the Markov chain in the corresponding 
trap. 

Trap models involve the ad-hoc introduction of three 
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major features that ultimately need justification. This is 
the independence of the waiting times associated to the 
traps, the heavy-tailed nature of their distribution, and 
finally the Markov pro per ty of th e dy namics. 

In a series of papers flU ll7L [l8l Ojf (see [2(| for a com- 
prehensive review) a systematic investigation of a variety 
of trap models was initiated. In this process, it emerged 
that the " slow-down" of the dynamics appears to be uni- 
versal for these trap models (except in the exceptional 
dimension 1), and, more precisely, that it has a scaling 
limit given by an a-stable subordinator. Equivalently it 
was shown that the classical Dynkin-Lamperti picture for 
heavy-tailed renewal processes is universally satisfied for 
these trap models (in dimension larger than 1). 

In contrast, very little has been done concerning the 
derivation of trap-model dynamics from stochastic dy- 
namics of even moderately realistic spin-glasses, such as 
the p-spin interaction SK models. The only case where 
this has been achieved so far is the simplest of these mod- 
els, the Random Energy Model (REM) of Derrida with 
a particular form of the transition rates. In [U [22|, HH 
this was achieved by a very detailed analysis of the dy- 
namics at time scales just before the equilibration time, 
and at temperatures below the critical one. This result 
relied, in particular, on the detailed understanding of the 
equilibrium distribution of this model. More recently, in 
[l8|, the same model was analyzed at much shorter (but 
still exponentially large) time scales. It emerged that the 
same aging mechanism is in place there and that aging 
can also occur above the critical temperature. 

All these works made crucial use of the independence 
of energies of different spin configurations assumed in the 
definition of the REM. In the present letter we present 
the first rigorous aging results in a model with correlated 
energies, the p-spin interaction Shcrrington-Kirkpatrick 
(SK) model of spin glasses withp > 3. Quite surprisingly, 
the results obtained point again to the validity of the 
REM-like trap model as universal aging mechanism. 
The p-spin SK model. We recall that the p-spin SK model 
is defined as follows. A spin configuration a is a vertex 
of the hypercube Sn = {— 1, 1}^. The Hamiltonian is 
given by 

1 N 

H N {a) = ~ N{p _ 1)/2 Jii-ipVii •••Tip) (1) 

n *p=l 

where J^..., are independent Gaussian random variables 
with mean zero and variance one. Alternatively, we can 
describe the Hamiltonian as a centered normal process 
indexed by Sn with covariance 

E[H n (o-)H n (t)} = NR n {<t,t) p , (2) 

where Rn(o~, t) denotes as usual the normalized overlap, 
Rn(&,t) = N^ 1 J2iLi (J i T i- We define a random Gibbs 
measure on Sn, M/3,JV = ZZ N e~^ Hn ^ a '. Note that in the 
limit p T oo one recovers the random energy model [24|, 
where Hn(o~) are i.i.d. Gaussian random variables with 
variance N . 



Dynamics. We consider a continuous time Markov dy- 
namics o~N(t) on Sn whose transition rates are 



Pn{(J,t) = N~ 



-l e /3H N (a) 



(3) 



if a and r are related by flipping a single spin, and are 
zero otherwise. It is easy to see that this dynamics is 
reversible with respect to the Gibbs measure ///j,jv- One 
also sees that it represents a nearest-neighbor random 
walk on the hypercube with traps of random depths. 

It is thus useful to look at this dynamics as at a time 
change of a simple unbiased discrete-time random walk 
Yjv(fc), k € N, on Sn started out of equilibrium^^ at 
some at some fixed point of Sn, say at {!,...,!}: We 
define the clock-process by 

k-l 

S N (k) = J2e iex p{-l3H N (YN(i))}, (4) 

z=0 

where (ei,i £ N) is a sequence of mean-one i.i.d. expo- 
nential random variables. Then <7/v(i) can be written as 



a N (t)=Y N (S N \t)) 



(5) 



S'jv(fc) is the instant of the fc-th jump of <jjv(i). 
The REM-like trap model. The idea suggested by the 
known behavior of the equilibrium distribution is that 
this dynamics, for (3 > (3 C , will spend long periods 
of time in the states a^\a^ 2 \... etc. and will move 
"quickly" from one of these configurations to the next. 
Based on this intuition, Bouchaud et al. [l^, EH pro- 
posed the "REM-like" trap model: Consider a continu- 
ous time Markov process Zm whose state space is the 
set Km = {I, . . . , M} of M points, representing the M 
"deepest" traps. Each of the states is assigned a ran- 
dom variable (representing minus the energy of the 
state fc) which is taken to be exponentially distributed 
with rate one. If the process is in state fc, it waits an ex- 
ponentially distributed time with mean proportional to 
e@ £k , and then jumps with equal probability in one of the 
other states fc' € Km- 

The quantity that is used to characterize the "ag- 
ing" phenomenon is the probability HM(t, s) that dur- 
ing a time-interval \t, t + s] the process does not jump. 
Bouchaud and Dean fijl showed that, for f3 > 1, 



lim lim 



ii M (t,s) 



s,t\oo Affoo H 1 /p(s/t) 

where the function H a is defined by 

_ sin(7ra) f°° dx 



f, 



H a (w) 



(I + x)x a ' 



(6) 



(7) 



The dynamics of the REM-like trap model can be seen 
as a time change of a simple random walk Ym on the 
"complete graph" K M by the clock process, §M{k) — 



12i=o ei ex P{fl £ Y M (i)}- As explained in [18], the result 
© can be deduced from the stronger claim 

lim lim n- S M (nt) = cV 1/j3 (t), t > 0, (8) 

n|oo Mf oo 
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where V a (t) is the a-stable subordinator (increasing 
Levy process) with Laplace transform ¥\e~ XVa ^] — 
exp(-iA Q ). 

The REM. In [HI, [H [H[ it was confirmed that the REM- 
like picture is correct, at least for the dynamics defined 
in ([3]). This result was further extended to shorter time 
scales in [l8[ where the point of view of (JSJ) was put in the 
foreground. Namely, it was shown that the clock process 
converges again to the stable subordinator: For every 
< g < 1, if p e = /3/y/g > P c = V2hT2, 7 = /ty2£ln2, 

hm e- lN N^ts N {t2 Ne ) = cVp c/0e (t). (9) 

This implies then a similar aging result as in ([5]), 

n(te 7jv ,se 7jv ) H 0c/ p e (s/t), as in the REM-like 

trap model for the probability IIjv(£, s) that <7jy(t) does 
not jump between t and t + s. 

Note that this result has an interesting interpreta- 

_ p„ 

tion: at a time-scale e 7 N ^ the process succeeds to 
make 2 eN steps, that is it explores a subset of configu- 
ration space that corresponds to a "little REM" in vol- 
ume n — gN. At this time scale, the process feels an 
effective inverse temperature (3 e . If the effective temper- 
ature is below the critical one for the standard REM, the 
system shows aging, otherwise it does not. It may seem 
somewhat counterintuitive that the systems is 'effectively 
"warming up" as time goes by. 

Let us discuss the heuristics of this result. When the 
random walk has made 2 eN steps, with g < 1, it has 
only explored a small fraction of the total configuration 
space. In particular, it has not had time to find the ab- 
solute minima of Hn, hence it is still out of equilibrium. 
Moreover, the random walk will essentially not have vis- 
ited any configuration twice. Therefore, the minimum 
of Hn along on those configurations that were visited is 
the minimum of 2 gN independent Gaussian random vari- 
ables of mean zero and variance N. It is well know (see 
e.g. HH) that this is of the order Ny/2g\n2. Then 
the mean waiting time in this extreme trap is of order 
exp(/37Vv / 2ghi2) = e lN , up to a polynomial correction. 
Now the condition that P e > f3 c implies that this time is 
of the same order as the total time the process has ac- 
cumulated in all the other sites along its way, and, more 
precisely, the process will have spent all but a negligible 
fraction of its time in the "few" "deepest trap" . Again 
standard results of extreme value theory imply that the 
precise statistics of the times spent in the deepest traps 
are asymptotically governed by a Poisson process, and 
that the sum of these random times, after rescaling, con- 
verge to a stable subordinator, as claimed. 

p-spin models We will now present our main results for 
the p-spin SK model. The full proofs of these results 
are given in (26[. First, since the valleys in the free- 
energy landscape contains more than one configuration, 
we should change the two-point function II. We set 

W N (t,s) ^P{R N (a N (te^ N ),a N ((t + S )e^ N )) > 1 - e}, 

(10) 



that is the overlap at two far-distant time instants is ex- 
ceptionally large. Then, a similar result as in the REM 
holds, at least if p > 3. Namely, there is a p-dependent 
constant g p , such that , if g and (3 satisfy the conditions 

p e = p/y/g > V2W2 = p c 
g < g p , 

then, for any e € (0, 1), t > 0, and s > 0, 

kn E ff (i )S ) = ff We ( s /(), (12) 

The basis of this result is again the statement analogous 
to ([9]) that shows that the properly rescaled clock-process 
converges to a stable subordinator. 

The function g p used in (jlip to limit the considered 
time scales is increasing and it satisfies 

£>3 ~ 0.763 and lim g p = 1, (13) 

p]' oo 

hence in the limit p f oo we recover the result for the 
REM. 

Note the role of the two restrictions on P and g. The 
first one is again the statement that the effective tem- 
perature at the time scale considered is below the critical 
one. The second condition is related to the correlation 
of the energies. It implies that the REM-like behavior 
holds only up to time scales where the explored region 
is still so small that the process does not feel the cor- 
relations; essentially it ensures that the process does not 
have enough time to get close enough to a point it visited 
before so that it is able to feel the correlations. 

The heuristic justification of the results in the p-spin 
model is rather similar to that of the REM. The difference 
here is that the energies at the sites that the walk has 
visited are correlated. Our assertion is that under the 
condition g < g p , this has only a mild effect and does 
not change the overall picture. The reason for this relies 
on the geometric properties of typical trajectories of the 
random walk on the hypercube, and on the extreme value 
properties of correlated Gaussian processes. First, it has 
been shown (see e.g. [27ll28ll29|) that if p is larger than 2 
and if K e is a totally random subset of the hypercube Sn 
of cardinality 2 eN , with g sufficiently small (depending on 
p), then the extremal process of Hm restricted to K e are 
the same as if Hn{o~) were independent random variables. 
Note that this is not true in the standard SK model with 
p = 2 which is the reason our results can be expected 
only for p > 3. 

Now it is clear that the trajectories of the random walk 
cannot look exactly like a totally random set, after all the 
trajectory is connected, while in K g essentially all points 
are isolated. However, a detailed analysis of the random 
walk reveals that its trajectories look very much like a 
random set K g with linear pieces between them joining 
the points up in a minimal way. Hence, the correlations 
have some impact only very locally in time, implying 
in particular that deep traps will not be made of single 
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points but consist of a deep valley (along the trajectory) 
that has approximately the same depth and whose shape 
and width we can describe quite precisely. Remarkably, 
each valley is essentially of a size independent of N (that 
is the number of sites contributing significantly to the 
residence time in the valley is essentially finite), and dif- 
ferent valleys are statistically independent. 

The fact that traps are finite may appear quite surpris- 
ing to those familiar with the statics of p-spin models. 
From the results there (see [13)1131), one knows that the 
Gibbs measure concentrates on "lumps" whose radius is 
of order Ne p , with e v > 0. The mystery is however solved 
easily: Around a local minimum <to with Hn((To) ~ 
— 7-/V//3, the process Hn{(j) does grows essentially lin- 
early with the distance d(ao , cr) from the minimum, 
E[_ffjv(er) — Hjy(<T )] ~ c(p, 7,/3)d(ao, <t). Therefore, the 
Gibbs mass decreases exponentially with g?(<7o , a) . For 



the support of the Gibbs measure, one needs to take into 
account the entropy, that is that the volumes of the balls 
of radius r increases like exp(iV(ln2 — 1(1 — r/N))). For 
the dynamics, at least at our time-scales, this is, how- 
ever, irrelevant, since the simple random walk leaves a 
local minimum essentially ballistically. 

Remark: We conclude the Letter with a remark on the 
role of the particular choice of the transition probabilities 
(J3j) depending only on starting points. Clearly these favor 
the proximity to Bouchaud's model. For us, on a techni- 
cal level, the independence of the random walk trajectory 
of the random environment defined by the Hamiltonian 
is crucial. Even in the case of the REM, we do not know 
at this point how to deal with different, and more usual, 
dynamics such as Metropolis or heat bath. This problem 
remains one of the great challenges in the field. 
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